A Statistical Method for an Automatic Detection of Form Types
Identifieur interne : 002019 ( Main/Exploration ); précédent : 002018; suivant : 002020A Statistical Method for an Automatic Detection of Form Types
Auteurs : Saddok Kebairi [France] ; Bruno Taconet [France] ; Abderrazak Zahour [France] ; Said Ramdane [France]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 1999.
Abstract
Abstract: In this paper, we present a method to classify forms by a statistical approach; the physical structure may vary from one writer to another. An automatic form segmentation is performed to extract the physical structure which is described by the main rectangular block set. During the form learning phase, a block matching is made inside each class; the number of occurrences of each block is counted, and statistical block attributes are computed. During the phase of identification, we solve the block instability by introducing a block penalty coefficient, which modifies the classical expression of Mahalanobis distance. A block penalty coefficient depends on the block occurrence probability. Experimental results, using the different form types, are given.
Url:
DOI: 10.1007/3-540-48172-9_8
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 002688
- to stream Istex, to step Curation: 002509
- to stream Istex, to step Checkpoint: 001570
- to stream Main, to step Merge: 002128
- to stream Main, to step Curation: 002019
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">A Statistical Method for an Automatic Detection of Form Types</title>
<author><name sortKey="Kebairi, Saddok" sort="Kebairi, Saddok" uniqKey="Kebairi S" first="Saddok" last="Kebairi">Saddok Kebairi</name>
</author>
<author><name sortKey="Taconet, Bruno" sort="Taconet, Bruno" uniqKey="Taconet B" first="Bruno" last="Taconet">Bruno Taconet</name>
</author>
<author><name sortKey="Zahour, Abderrazak" sort="Zahour, Abderrazak" uniqKey="Zahour A" first="Abderrazak" last="Zahour">Abderrazak Zahour</name>
</author>
<author><name sortKey="Ramdane, Said" sort="Ramdane, Said" uniqKey="Ramdane S" first="Said" last="Ramdane">Said Ramdane</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:8B5FF991DBC62B054AA1AC2D82E422CCE847DBF9</idno>
<date when="1999" year="1999">1999</date>
<idno type="doi">10.1007/3-540-48172-9_8</idno>
<idno type="url">https://api.istex.fr/document/8B5FF991DBC62B054AA1AC2D82E422CCE847DBF9/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">002688</idno>
<idno type="wicri:Area/Istex/Curation">002509</idno>
<idno type="wicri:Area/Istex/Checkpoint">001570</idno>
<idno type="wicri:doubleKey">0302-9743:1999:Kebairi S:a:statistical:method</idno>
<idno type="wicri:Area/Main/Merge">002128</idno>
<idno type="wicri:Area/Main/Curation">002019</idno>
<idno type="wicri:Area/Main/Exploration">002019</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">A Statistical Method for an Automatic Detection of Form Types</title>
<author><name sortKey="Kebairi, Saddok" sort="Kebairi, Saddok" uniqKey="Kebairi S" first="Saddok" last="Kebairi">Saddok Kebairi</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d’Informatique du Havre, Université du Havre, Place Robert Schuman, 76610, Le Havre</wicri:regionArea>
<placeName><region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
<settlement type="city">Le Havre</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">France</country>
</affiliation>
</author>
<author><name sortKey="Taconet, Bruno" sort="Taconet, Bruno" uniqKey="Taconet B" first="Bruno" last="Taconet">Bruno Taconet</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d’Informatique du Havre, Université du Havre, Place Robert Schuman, 76610, Le Havre</wicri:regionArea>
<placeName><region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
<settlement type="city">Le Havre</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">France</country>
</affiliation>
</author>
<author><name sortKey="Zahour, Abderrazak" sort="Zahour, Abderrazak" uniqKey="Zahour A" first="Abderrazak" last="Zahour">Abderrazak Zahour</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d’Informatique du Havre, Université du Havre, Place Robert Schuman, 76610, Le Havre</wicri:regionArea>
<placeName><region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
<settlement type="city">Le Havre</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">France</country>
</affiliation>
</author>
<author><name sortKey="Ramdane, Said" sort="Ramdane, Said" uniqKey="Ramdane S" first="Said" last="Ramdane">Said Ramdane</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d’Informatique du Havre, Université du Havre, Place Robert Schuman, 76610, Le Havre</wicri:regionArea>
<placeName><region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
<settlement type="city">Le Havre</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">France</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>1999</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">8B5FF991DBC62B054AA1AC2D82E422CCE847DBF9</idno>
<idno type="DOI">10.1007/3-540-48172-9_8</idno>
<idno type="ChapterID">8</idno>
<idno type="ChapterID">Chap8</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: In this paper, we present a method to classify forms by a statistical approach; the physical structure may vary from one writer to another. An automatic form segmentation is performed to extract the physical structure which is described by the main rectangular block set. During the form learning phase, a block matching is made inside each class; the number of occurrences of each block is counted, and statistical block attributes are computed. During the phase of identification, we solve the block instability by introducing a block penalty coefficient, which modifies the classical expression of Mahalanobis distance. A block penalty coefficient depends on the block occurrence probability. Experimental results, using the different form types, are given.</div>
</front>
</TEI>
<affiliations><list><country><li>France</li>
</country>
<region><li>Haute-Normandie</li>
<li>Région Normandie</li>
</region>
<settlement><li>Le Havre</li>
</settlement>
</list>
<tree><country name="France"><region name="Région Normandie"><name sortKey="Kebairi, Saddok" sort="Kebairi, Saddok" uniqKey="Kebairi S" first="Saddok" last="Kebairi">Saddok Kebairi</name>
</region>
<name sortKey="Kebairi, Saddok" sort="Kebairi, Saddok" uniqKey="Kebairi S" first="Saddok" last="Kebairi">Saddok Kebairi</name>
<name sortKey="Ramdane, Said" sort="Ramdane, Said" uniqKey="Ramdane S" first="Said" last="Ramdane">Said Ramdane</name>
<name sortKey="Ramdane, Said" sort="Ramdane, Said" uniqKey="Ramdane S" first="Said" last="Ramdane">Said Ramdane</name>
<name sortKey="Taconet, Bruno" sort="Taconet, Bruno" uniqKey="Taconet B" first="Bruno" last="Taconet">Bruno Taconet</name>
<name sortKey="Taconet, Bruno" sort="Taconet, Bruno" uniqKey="Taconet B" first="Bruno" last="Taconet">Bruno Taconet</name>
<name sortKey="Zahour, Abderrazak" sort="Zahour, Abderrazak" uniqKey="Zahour A" first="Abderrazak" last="Zahour">Abderrazak Zahour</name>
<name sortKey="Zahour, Abderrazak" sort="Zahour, Abderrazak" uniqKey="Zahour A" first="Abderrazak" last="Zahour">Abderrazak Zahour</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002019 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002019 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:8B5FF991DBC62B054AA1AC2D82E422CCE847DBF9 |texte= A Statistical Method for an Automatic Detection of Form Types }}
This area was generated with Dilib version V0.6.32. |